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Description 

METHOD AND SYSTEM FOR 
OPTIMIZED INSTRUCTION FETCH 
TO PROTECT AGAINST SOFT AND 

HARD ERRORS 

BACKGROUND OF INVENTION 

[0001] This invention relates to the field of computerized data processing and, 
in particular, to a method and system for detecting error and 
implementing error correction during transfer of instructions from 
memory to a computer processor. 

[0002] Embedded systems typically have a computer processor and an 

attached instruction memory unit that feeds instructions to be executed 
by the processor. The instruction unit must minimize instruction access 
time by providing a fast path to stored code. If instructions are not 
provided at the rate in which they are executed, the processor will stall, 
and precious cycles will be wasted waiting for code. 

[0003] As ^bedded computer memory scales to smaller and smaller sizes, 
soft error rates have increased significantly. This is due in major part to 
the ever-increasing integration factor and decreasing feature 
dimensions. Unfortunately, soft errors can cause major problems in 
embedded systems. One example of particular concern is executing an 
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incorrect instruction. If a soft error mutates an instruction, problems 
running the gamut from corrupted data to system failure can arise. 
None of this is acceptable in most applications. Even worse yet, the 
error may not be immediately obvious, making detection and correction 
more difficult. 

[0004] An error correcting code (ECC) technique is generally used to improve 
data integrity. ECC involves adding a number of check bits to a data 
word in order to detect and correct one or more bits that have flipped 
while the data was stored. Unfortunately, ECC generally carries a 
penalty with its use. The data word and check bits must be decoded to 
determine if correction is needed and which bit to correct. Typically, one 
or more additional clock cycles are required in order to accomplish this 
correction, and ECC implementations generally suffer this performance 
penalty. However, since data integrity is crucial, the tradeoff in 
performance is made. 

* * 

[0005] An example of a prior art ECC mechanism for accessing instructions 
can be found in U.S. Patent No. 6,108,753, which employs an 
automatic retry when an ECC or parity error is detected. Unfortunately, 
this method may suffer a performance penalty because the processor 
cannot assume it will have a valid instruction in all cases. Therefore, the 
processor must wait for an error determination before deciding whether 
or not it can proceed. 

[0006] | n the error detection and correction system of U.S. Patent No. 
4,646,312, a parity error signal pauses the processor clock when 
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asserted. During this pause, ECC correction is done on correctable 
data, and the processor clock is restarted when corrected data is 
available. Though this method improves performance execution over 
waiting for corrected data each cycle, the error signal must be able to 
halt clocks before an invalid instruction is executed. However, adding 
the ability to halt clocks to a processor can seriously limit pipelining 
within the processor, translating to extended latency and lower 
performance. Further, the delay from halting the clock during ECC 
correction of a corrupted instruction is experienced regardless of 
whether or not the instruction in question is actually executed. This is 
due to the fact that the delay occurs during retrieval. 

SUMMARY OF INVENTION 

[0007] Bearing in mind the problems and deficiencies of the prior art, it is 
therefore an object of the present invention to provide an improved 
method and system for detecting error during transfer of data signals 
from a data memory, particularly instructions from an instruction 
memory, to a computer processor. 

[0008] It is another object of the present invention to provide a method and 
system for detecting error during transfer of data signals from a data 
memory that maximizes data integrity. 

[0009] A further object of the invention is to provide a method and system for 
detecting error during transfer of data signals from a data memory that 
permits correction of the error with fewer penalties to performance of 
the processor. 
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[0010] The above and other objects, which will be apparent to those skilled in 
art, are achieved in the present invention which is directed to a method 
of detecting error during transfer of data signals from a data memory to 
a computer processor comprising initially commencing transmission of 
a raw data signal from a data memory to a computer processor, 
wherein the raw data signal includes an error detection code. At the 
time of the commencement of transmission of the raw data signal from 
the data memory to the computer processor, the method then includes 
checking the raw data signal for corruption based on its error detection 
code. If the data has not been corrupted, the method includes 
completing transmission of the raw data signal to the computer 
processor. However, if the error detection code indicates data 
corruption, the method then includes substituting the raw data signal 
with a predetermined reserved data signal and transmitting the 
predetermined reserved signal to the computer processor. Further, the 
method includes processing the raw data signal or the reserved signal 
with the computer processor. If the error detection code indicates data 
corruption, the method may further include determining if the corrupted 
data in the original raw data signal may be corrected, and subsequently 
retrieving the corrected data and processing the corrected data signal 
with the computer processor. 

[0011] 

If the computer processor processes the predetermined reserved 
signal, the method may also include determining whether corrupted 
data in the raw data signal has been corrected, and, if corrected, 
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subsequently retrieving the corrected data and processing the 
corrected data signal with the computer processor. If the corrupted data 
in the raw data signal has been corrected, the raw data signal in the 
data memory may be replaced with the corrected raw data signal. 

[0012] If the computer processor processes a predetermined reserved 

instruction, the computer processor may execute an error handling 
routine comprising determining whether corrupted data in the raw data 
signal has been corrected, and, if corrected, retrieving the corrected 
data signal, processing the corrected data signal with the computer 
processor, and replacing the raw data signal in the data memory with 
the corrected data signal. 

[0013] Preferably the computer processor operates on timed, uniform clock 
cycles, and the steps of transmitting the raw data signal from the data 
memory to a computer processor, simultaneously checking the raw 
data signal for corruption, and transmitting either the raw or 
predetermined reserved signal to the computer processor, are all 
performed within a single clock cycle. , 

[0014] The method further preferably includes commencing transmission of a 
subsequent raw data signal from the data memory to the computer 
processor and repeating the aforementioned steps for the subsequent 
raw data signal, until all desired raw data signals from the data memory 
are processed by the computer processor. 

[0015] 

The checking of the raw data signal for presence of data corruption is 
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preferably performed simultaneously with the commencement of 
transmission of the raw data signal from the data memory to the 
computer processor. 

[0016] Preferably, the error detection code may comprise ECC code, and the 
raw data signal may be a multi-bit data signal. More preferably, the raw 
data signal is an instruction from a random access memory associated 
with the computer processor. 

[0017] In another aspect, the present invention is directed to a program 

storage device readable by a machine, tangibly embodying a program 
of instructions executable by the machine to perform the 
aforementioned method steps for detecting error during transfer of raw 
data signals from a data memory to a computer processor. 

[0018] | n a f ur th er aspect, the present invention is directed to a system for 

detecting error during transfer of data signals from a data memory to a 
computer processor comprising a computer processor and a data 
memory device containing raw data to be processed by the computer 
processor, wherein the raw data being in the form of a raw data signal 
including an error detection code. The system further includes a data 
checker device adapted to check the raw data for presence of data 
corruption and, if the error detection code indicates data corruption, 
determining if the corrupted data in the original raw data may be 
corrected. The system also includes a computer processor instruction 
unit. The instruction unit is adapted to commence transmission of a raw 
data from the data memory device to the computer processor and 
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cause the data checker device to check the raw data signal for 
corruption via decode of the data and error detection code. The 
instruction unit is further adapted to cause completion of transmission 
of the raw data to the computer processor if the error detection code 
indicates no data corruption or, if the error detection code indicates 
data corruption, cause substitution of the raw data with a predetermined 
reserved instruction and transmission of the predetermined reserved 
instruction to the computer processor. 

[0019] The computer processor, upon processing a predetermined reserved 
instruction, may be adapted to execute an error handling routine 
comprising determining whether corrupted data in the raw data has 
been corrected, and, if corrected, retrieving the corrected data, 
processing the corrected data with the computer processor, and 
replacing the raw data in the data memory device with corrected data. 

[0020] The computer processor may include a clock, such that the processor 
adapted to operate on timed, uniform clock cycles produced by the 
clock. Preferably, the computer processor instruction unii is adapted to 
cause transmission of the raw data from the data memory device to the 
computer processor, the data checker device is adapted to check 
simultaneously the raw data for presence of data corruption, and the 
computer processor instruction unit is adapted to transmit either the raw 
or predetermined reserved instruction to the computer processor, within 
every a cycle. 

[0021] -p ne S y S tem may further include a corrected data register adapted to 
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receive raw data corrected by the data checker device and transmit 
corrected raw data to the computer processor. The data memory is 
preferably a random access memory associated with the computer 
processor and the raw data is preferably an instruction from the random 
access memory for the computer processor. 

BRIEF DESCRIPTION OF DRAWINGS 

[0022] The features of the invention believed to be novel and the elements 
characteristic of the invention are set forth with particularity in the 
appended claims. The figures are for illustration purposes only and are 
not drawn to scale. The invention itself, however, both as to 
organization and method of operation, may best be understood by 
reference to the detailed description which follows taken in conjunction 
with the accompanying drawings in which: 

[0023] Fig. 1 is a schematic view of the preferred instruction memory controller 
of the present invention. 

[0024] Fig. 2 is a schematic view of the instruction memory controller of Fig. 1 
showing additional preferred components in accordance with the 
present invention. 

DETAILED DESCRIPTION 

[0025] In describing the preferred embodiment of the present invention, 

reference will be made herein to Figs. 1 and 2 of the drawings in which 
like numerals refer to like features of the invention. 

[0026] 

In its preferred embodiment, the present invention provides a method 
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and apparatus to allow an instruction on-chip memory unit (l-OCM) 
associated with a computer processor to implement error correcting 
code (ECC) to maximize data integrity while minimizing performance 
overhead. An example of a computer processor is the PowerPC 405 
manufactured by the assignee, International Business Machines 
Corporation of Armonk, NY. The term data is to be understood to 
include instructions to be executed by the computer processor, The 
invention implements ECC code to protect against soft and hard errors 
while providing instructions in a timely manner to avoid stalling the 
processor operation. When an error is detected, the unit replaces the 
corrupt instruction on the fly. This will then allow the processor to 
retrieve the corrected instruction in a subsequent operation. Thus the 
processor only incurs a performance penalty when an error is actually 
encountered rather than on every memory access, and improves the 
classic data integrity vs. performance penalty trade-off-. 

[0027] 

As shown in Fig. 1, an instruction side on-chip memory (l-OCM) unit 10 
provides instructions to the computer processor unit (CPU) 12 via the 
processor's l-OCM interface 1 1 . The purpose of the l-OCM unit is to 
minimize CPU instruction access time by providing a fast path to stored 
code. The l-OCM also provides the capability for code to be read from oth 
data sources over the processor local bus (PLB) slave 18 and to be 
updated on the fly, thus enabling dynamic error detection and correction. 
The corrected data register (DCR) 14 stores instructions or other data tha 
has been corrected. As is standard, CPU 12 includes a clock, and the 
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processor operates on timed, uniform clock cycles produced by the clock. 
Clock 16 and reset 20 inputs are provided to l-OCM controller 10. 
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The interface signals shown in Fig. 1 are described below in Table 1: 



Table 1 



Signal Name 


I/O 


Interface 


Description 










CPUJsocmReqPending 


IN 


CPU 


One of a set of signals which In 
various combinations indicate a 
processor request for one or two 
32-bit instructions. It is asserted by 
the processor to indicate that a 
fetch request is pending in the 
CPU. 


CPUJsocmlcuReady 


IN 




Another in the set of signals used to 
generate a fetch request from the 
processor. When asserted with 
Req Pending, a new fetch request is 
being presented. 


CPU isocmABus(0:29) 


IN 




Address of the instruction(s) being 
requested by the CPU. 


CPU-isocmXIateValid 


IN 




Part of the group of signals used by 
the processor to request an 
instruction fetch. This signal 
normally has an additional function 
to validate storage attributes, 
however, that function is not being 
used in this design. 


CPUJsocmAbort 


IN 




Indicates that the currently 
outstanding fetch request is being 
aborted. No RdDValid bits may be 
asserted for the associated request 
after the cycle in which the Abort is 
asserted, If ReqPending is asserted 
along with Abort, a new fetch 
request is being presented. j 


ISOCM<puHoid 


OUT 




The Hold signal is asserted by the 
ISOCM to indicate to the processor 
that it is working on its request, but 
unable to complete the transfer 
during the current cycle. 


ISOCM<puRdDValid(0:1) 


OUT 




Validates data words presented to 
the processor on RdDBus(0:63) 
lines. 

Bit 0: Even word data valid an 
RdDBus(0:31) 

Bit 1 : Odd word data valid on 
RdDBus (32:63) 


ISOCM-cpuRdDBus(0:63) 


OUT 




64-bit data bus used to transfer 
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instructions to the CPU. 


SLV isocmRd Req 


IN 


PLB 


Read request signal 


SLV isocmWr Req 


IN 




Write request signal 


SLV isocmRd Abus(0:31) 


IN 




Read address 


SLV isocmWr Abus(0:31) 


IN 




Write address 


SLV isocmRd BE (0:3) 


IN 




Read byte enables 


SLV isocmWr BE(0:3) 


IN 




Write byte enables 


SLV isocmRd Nwords(0:3) 


IN 




Read number of words 


SLV isocmWr Nwords(0:3) 


IN 




Write number of words 


SLV isocmWr Data(0:31) 


IN 




Write data bus 


SLVJsocmSampleCycleL2 


IN 




PLB Sample Cycle to synchronize 
PLB activity when running at higher 
frequencies 


ISOCM sIvRd Ack 


OUT 




Read Acknowledge 


ISOCM slvWr Ack 


OUT 




Write acknowledge 


ISOCM sIvRd Data(0:31) 


OUT 




Read data bus 


ISOCM sIvTimeout R 


OUT 




Array access error detected on read 


ISOCM sIvTimeout W 


OUT 




Array access error detected on write 


ISOCM sIvTargetAdr 


OUT 




Target Address of read data 


PGM dcrAddr(0:5) 


IN 


DCR Bus 


Programmable DCR unit select 


CPU dcrRead 


IN 




Indicates that the CPU is requesting 
to read data 


CPU-dcrWrite 


IN 




Indicates that the CPU is requesting 
to write data 


-CPU_dcrABus(0:9) 


IN 




Indicates the address of the DCR 
which is being accesses for a read 
or write DCR operation 


XXX_dcrData(0:31) 


IN 




The 32-bit data bus output to 
transfer write data from the CPU. 
This bus also drives 0x00000000 
during DCR read operations 


ISOCM_dcrAck 


OUT 




Indicates that the DCR data is 
available on 

ISOCM_cpuDBusin(read), or that 
the new DCR data is latched (write) 


ISOCM<icrData(0:31) 


OUT 




The 32-bit data bus used to transfer 
read data from a DCR to the CPU 
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[0028] Fig. 2 shows details of the preferred system, wherein l-OCM unit 10 
contains a large static random access memory (SRAM) 24 or other 
data memory device for instruction or other data code storage which is 
accessible for reads from CPU 12, and for both reads and writes over 
PLB 18. The l-OCM control unit 10 supplies instructions from the 
instruction SRAM 24 to CPU 12, via fetches over the CPU l-OCM 
interface 1 1 . Normally, these instructions are the raw, uncorrected 
output of the SRAM, which provides the best performance path. 

[0029] CPU transfers require valid requests before CPU instruction fetches 

may be made. During the CPU instruction fetches, the preferred CPU I- 
OCM interface 1 1 handles data as 64 or 32 bit transfers. The 72 bit 
SRAM data word width includes 64 data bits and 8 ECC check bits. In 
normal operation, raw, uncorrected SRAM data (minus the check bits) 
are transferred during a CPU access in order to minimize the number of 
cycles required for an instruction fetch. While SRAM 24 is retrieving the 
data, the l-OCM unit 10 signals its intention to provide the instruction to 
CPU 12 in the next cycle, to minimize the instruction access time. 
When the data is available from the SRAM 24, the instruction unit 
causes transmission of the raw data 29 from SRAM 24 to the MUX 28 
and error correcting code, generation checking (ECC GEN/CK) unit 26 
to test for data corruption, for example due to a soft error in SRAM 24. 
However, if the data is corrupt, there is not enough time to correct it, 
since the system is preferably optimized for the non-error case. 

[0030] 

In accordance with the present invention, l-OCM unit 10 continues to 
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provide an instruction within the same CPU clock cycle, even when the 
instruction read from SRAM 24 is corrupt. If an ECC error is detected in 
the data fetched from the SRAM, instead of passing on the corrupt 
instruction, l-OCM unit 10 swaps in a predetermined error-indicating 
signal or instruction 30 in place of the retrieved corrupted data, and 
presents it to CPU 12 via _the multiplexer (MUX) 28, also within the 
same CPU clock cycle. The predetermined error-indicating signal is a 
properly formed but reserved instruction or operation code. An attempt 
is also made at the same time by ECC GEN/CK 26 to determine if the 
error is correctable, and then to correct the corrupted data by known 
correction techniques. Any corrected data 31 which has been 
processed by ECC GEN/CK 26 will then be stored in the l-OCM 
corrected data register (DCR) 14. 

[0031] If and when computer processor 12 attempts to execute the 

predetermined reserved error-indicating signal or reserved op-code 30, 
an interrupt is taken. The interrupt handler preferably then proceeds as 
follows with an error handling routine: 

[0032] 1 . Determine if the error was correctable by reading the l-OCM status 
register. 

[0033] 2. If the error was correctable, read the corrected data or instruction 31 
from the l-OCM corrected data register (DCR) 14 via the DCR bus or 
interface 32. Then write the corrected data or instruction 31 back to 
SRAM 24, to substitute for the corrupted raw data 29, via a PLB 18 
write (discussed further below). 
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[0034] 3. If the error was not correctable, retrieve the correct original data or 
instruction from a back-up memory area attached to the PLB. 

[0035] 4. Continue CPU 12 processing with the corrected instruction. 

[0036] The predetermined error-indicating signal that is substituted for the 
corrupt data signal or instruction therefore serves several purposes. 
First, it prevents a protocol violation on l-OCM interface 1 1 by providing 
an instruction when one was promised. Second, if the instruction is 
never executed, due to an immediately preceding jump instruction, 
there is no delay in continued processor execution, though the SRAM 
may still be corrected later if desired. Finally, when the substituted 
instruction is executed, it preferably causes processor 12 to execute an 
error handling routine which can retrieve the corrected instruction 
stored by l-OCM unit 10, update SRAM 24, and continue execution of 
the instructions. Though this may take multiple cycles, the performance 
penalty is only paid when an error is encountered. Amortized over 
millions of instructions, the penalty for vastly improved data integrity is 
minimized. 

[0037] For PLB-originating writes of the corrected data or instruction, l-OCM 
controller 10 provides the capability to load code into SRAM 24 via PLB 
18. However, since the PLB Slave 18 interface is only 32 bits, each of 
these writes must actually be a read-mod ify-write to the 72 bit wide 
ECC-protected SRAM. The read-modify-write will basically be 
transparent to the PLB interface. l-OCM controller 10 may perform all of 
the steps necessary for the write as follows: 
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[0038] 1. Latch in the PLB address bus and 32 bit write data. 

[0039] 2. Wait until any currently executing l-OCM interface 1 1 request has 
completed. 

[0040] 3. Initiate an SRAM read of raw data 29 from the specified address. 

[0041] 4. Wait for ECC-corrected data 31 to be generated and stored in the 
corrected data register, DCR 14. 

[0042] 5. Using switch 34, combine the 32 bit write data from PLB 18 with the 
complementary half of the corrected data 31 from corrected data 
register, DCR 14, via latch 36. 

[0043] 6. Initiate an SRAM write, which will regenerate the ECC check bits and 
store the new 72 bit double word into the SRAM. 

[0044] The read-mod ify-write operation will likely take multiple CPU clock 
cycles, so subsequent PLB accesses to the instruction side of SRAM 
24 may be delayed. 

[0045] A PLB read of data or instruction from instruction side of SRAM 24 is 
somewhat simpler than a write, requiring l-OCM controller 10 to 
perform the following steps: 

[0046] 1 . Latch in the PLB address bus. 

[0047] 2. Wait until any currently executing l-OCM interface 1 1 request has 
completed. 

[0048] 3. Initiate an SRAM read of raw data 29 from the specified address. 
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[0049] 4. Wait for ECC-corrected data 31 to be generated. 

[0050] 5. Choose the upper or lower half of the corrected data via MUX 38 
depending on the requested address. 

[0051] A PLB read from l-SRAM will likely take multiple CPU cycles to 
complete. 

[0052] In the ECC operations, l-OCM controller 10 preferably utilizes a (72,64) 
Error Correcting Code to improve the reliability of the Instruction SRAM. 
Eight check bits are appended to the 64 data bits to provide single error 
correction and double error detection. All occurrences of both 
correctable (single bit) and uncorrectable (multiple bit) errors will cause 
the appropriate status / interrupt bits to be set in the l-OCM 
status/interrupt register. The l-OCM Interrupt will be asserted only when 
an uncorrectable error has been detected as a result of either a PLB or 
OCM interface request. 

[0053] Thus, the present invention to provide an improved method and system 
for detecting instruction error during transfer of data signals from a data 
memory to a computer processor. The invention permits the processor 
to proceed whether or not an error is detected in the accessed 
instruction, and the processor is only delayed when an error-indicating 
signal or instruction is actually executed. The method and system of the 
present invention therefore maximizes data integrity while permitting 
correction of the error with fewer penalties to performance of the 
processor. 
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[0054] While the present invention has been particularly described, in 

conjunction with a specific preferred embodiment, it is evident that 
many alternatives, modifications and variations will be apparent to 
those skilled in the art in light of the foregoing description. It is therefore 
contemplated that the appended claims will embrace any such 
alternatives, modifications and variations as falling within the true scope 
and spirit of the present invention. 

[0055] Thus, having described the invention, what is claimed is: 
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